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(57) Abstract: A data processing 
structure comprising a chain of 
data blocks containing a binary tree 
structure; each of said data blocks 
containing a predetermined number 
of nodes. 
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DATA PROCESSING STRUCTURE 

The present invention relates to a data processing 
structure and, more particularly, to such a data structure 
suited to simultaneously representing work to do and work 
5 done . 

BACKGROUND 

In the field of data processing there are many cases 
where a system agent needs to iteratively walk object 
10 subgraphs and perform some processing on all the objects 
tound within the subgraph. Starting from the root of the 
subgraph the agent walks the subgraph and discovers more 
objects which are not in the subgraph yet need to be 
processed. 

15 in doing this kind of processing the agent needs some 

way to represent 11 work to do" (i.e. objects yet to be 
walked/processed) and be able to test if an object has been 
processed (i.e. objects that have been walked/processed) so 
as to identify and avoid cycles. 

20 Typically a structure such as a binary tree or sparse 

array is used to support this kind of processing. 
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Often when an agent is processing a set of objects it 
requires a structure that enables it to record either what 
objects have or have not yet been processed* 

However it turns out in practice that a binary tree or 
5 sparse array is rather ineffective for representing "work to 
do" . The difference between "has/has not been processed*' and 
"work to do" may at first appear subtle , but they are 
eemantically different. The former is basically a testable 
proposition on a given handle while the later is a mutable 
10 indefinite set handles . 

The performance of binary trees and sparse arrays is 
particularly poor when the following three things happen: 

1. The structure represents work to do; 

2. The structure has to be linearly scanned to 
15 discover what work to do nexC; and 

3. While processing work further work is discovered 
(more objects are inserted into Lhe structure) 
therefore requiring the repetition of steps 2 & 3 ) . 

The fundamental reason why steps 2 & 3 must be repeated 
20 is due to the nature /structure of the binary trees and sparse 
arrays. If an object appears in such a structure it always 
appears in a particular place. 

Scanning a binary tree or sparse array can be done depth 
or breadth first/ but there is always some traversal cursor 
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moving through the nodes. If new values are inserted into 
the binary tree or sparse array (because of processing the 
handle already in the structure) , they are inserted with 
indifference to the traversal cursor. The upshot is that the 
5 structure must be rescanned until there is no further work. 

It is an object of the present invention to overcome or 
ameliorate one or more of the above mentioned disadvantages. 

BRIEF DESCRIPTION OF INVENTION 

10 Accordingly in one broad from of the invention there is 

provided a data processing structure comprising a chain of 
data blocks containing a binary tree structure; each of said 
data blocks containing a predetermined number of nodes. 

In yet a further broad form of the invention there is 
15 provided a data processing structure comprising one or more 
node blocks and a control structure; said control structure 
grouping nodes in said node blocks according to a physical 
ordering in computer memory. 

Preferably each said node block contains a pointer to 
20 the next node block and an array of binary tree nodes. 

Preferably said binary tree node contains a value, a 
left pointer and a right pointer. 

Preferably said control structure includes the 
following: 
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(a) the number of nodes, in a block; 

(b) a pointer to the first block; 

(c) a pointer to the last block; 

(d) a pointer to the current append block; 

5 (e) a pointer to last node appended in the current 

append block; 

(f) the number of free nodes in the current append 
block . 

In yet a further broad form of the invention there is 
10 provided a method of storage of a binary tree structure , said 
method comprising storing nodes of said tree in node arrays; 
each said node array comprising a predetermined number of 
node storage locations; said node storage locations of said 
predetermined number of node storage locations having a 
15 predetermined physical ordering in memory . 

In yet a further broad form of the invention there is 
provided a method of operation on the structure wherein a 
traversal cursor is utilized to keep track of work done; said 
cursor being advanced in a linear progression through said 
20 data processing structure as each node in said structure is 
processed whereby nodes through which said cursor has 
advanced do not require further processing. 

In yet a further broad form of the invention there is 
provided a method of processing a directed graph wherein as 
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further work to do is discovered in the course of progressing 
a traversal cursor through said graph, the work to do is 
inserted as a node ahead of the current location of the 
traversal cursor. 

5 

BRIEF DESCRIPTION OF DRAWINGS 

One embodiment of the present invention will now be 
described with reference to the accompanying drawings 
wherein : 

10 Fig. 1 represents a binary tree structure to which 

embodiments of the present invention can be applied; 

Fig. 2 illustrates an instance of a data processing 
structure according to a first preferred embodiment; 

Fig. 3 illustrates operations on the data processing 
15 structure of Fig. 2; and 

Fig. 4 illustrates in block diagram form further detail 
of the data processing structure of Fig. 2. 

DETAILED DESCRIPTION OF PREFERRED EMBODIMENT 

20 When used to represent *work to do" binary trees and 

sparse arrays have the following performance characteristics: 

1. 0(1) test time; 

2. 0(1) insert time; BUT 

3. 0(N log N) ... 0(N 2 ) scan time - depending upon usage. 
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With reference to Figs. 1, 2 and 3, a data processing 
structure 10 according to a first embodiment of the present 
invention comprises logically a binary tree 11 but physically 
held within a block chain - where each block holds a set of 
5 nodes . 

With reference to Fig. 1 assume we have the logical 
binary tree 11 comprising a plurality of nodes 12-22 as 
illustrated. 

Let's also assume we had added the nodes 12-22 in the 
10 following order: 12, 13, 14, 15, 16, 17, 18, 19, 20, 21 and 
then 22. 

Again for the purposes of our discussion let us assume 
that our block can hold four nodes. 

Therefore the structure 10 will appear as in Fig. 2. 
15 Now if we add two new nodes 23, 24 with reference to 

Fig. 3, the tree structure 11 is represented as illustrated. 

Observe that when a value is added into the structure 10 
it is physically appended at the end of the node sequence and 
then the logical binary tree 11 is updated.' It will be 
20 observed that structure 10 is such that the nodes have a 
physical ordering in memory. 

By having a physical ordering in memory the structure 10 
can be linearly (and efficiently) traversed and while being 
traversed new values can be added, such that all new values 
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appear physically ahead of a traversal cursor. Values ahead 
of the traversal cursor represent work to be done, whilst 
values behind the traversal cursor represent work already 
done • 

5 This property of adding new values ahead of the 

traversal cursor means the client application (not shown) can 
add/record further work to do. Advancing the traversal 
cursor eventually reaches the new work to be done. 

Having multiple nodes in a block means minimal memory 
10 allocation. 

The structure can be reused by resetting the pointers 
(thereby keeping the blocks if this is efficient), or can be 
deleted outright. 

The structure is extensible so it can be used to hold an 
15 indefinite amount of work. 

Because the structure 10 contains a binary tree 11, a 
value can be quickly tested to see if it is in the structure. 

With particular reference to Figs. 2 and 3, the data 
processing structure 10 comprises: 
20 a. One or more node blocks; and 

b. A control structure. 
Each node block contains: 

a. A pointer to the next node block (thereby forming a 
chain of node blocks) ; and 
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b. An array of traditional binary tree nodes (i.e. a 
value, a left pointer and a right pointer) . 

The control structure has at least six members: 
a. The number of nodes in a block; 
5 b. A pointer to the first block; 

c. A pointer to the last block; 

d. A pointer to the current append block; 

e. A pointer to last node appended in the current 
append block; and 

10 f. The number of free nodes in the current append 

block. 

The data processing structure 10 can allow the following 
operations : 

1. Create - creates an entirely new data processing 
15 structure 10; 

2. Destroy - completely destroys an existing data 
processing structure 10; 

3. Add - adds an item into a data processing structure 
10; 

20 4. Test - tests if an item is in a data processing 

structure 10; 

5. First - returns the first item in a data processing 
structure 10; 
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6. Next - returns the next item in a data processing 
structure 10 (given/passed the last one returned 
via first or next) ; and 

7. Zero - remove all items from the data processing 
5 structure 10 and resets it but leaves the existing 

chain of node blocks in place for reuse. 
In this embodiment the data processing structure 10 has 
the following properties: 



1. 


Can be linearly scanned; 


2. 


Further work is inserted ahead of scan; 


3. 


0(1) insert time; 


4. 


0 (Log N) test time; 


5. 


0(N) scan time; 


6. 


Reusable; 


7. 


Extensible; 


8. 


Minimal number of memory allocations ; 


9. 


High water mark of largest usage to date; 


10. 


Easily discarded; 


11. 


Readily tunable. 



20 With particular reference to Fig. 4 data processing 

structure 10 is described in further detail and with 
particular reference to the method of cursor progression 
through the structure . 
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Data processing structure 10 comprises, in this 
instance, first node block 30 , second node block 31 and third 
node block 32. Each of node blocks 30, 31, 32 comprises an 
array of four nodes 33 together with a pointer 34 to the next 

5 node block. Each node 33 includes a value 35, a left pointer 
36 and a right pointer 37. In this instance each node block 
30, 31, 32 comprises exactly four nodes. 

A control structure 38 is associated with the data 
processing structure 10 and includes pointer 39 to first node 

10 block 30, pointer 40 to the current append block (in this 
instance node block 32) , pointer 41 to last node block (in 
this instance third node block 32) . 

In addition the control structure 38 includes storage 
location 42 which contains the number of free nodes in the 

15 current append block (in this instance there are two spare 
nodes, namely spare node 42 and spare node 43 in the third 
node block 32) . 

In use a cursor 44 is moved through the control 
structure 10 starting with the first node of first node block 

20 30, progressing, in this instance, from left to right through 
all of the nodes in first node block 30, then progressing to 
the first node of second node block 31, progressing through 
all the nodes of second node block 31 (again from left to 
right) and then, finally, in this instance, progressing 
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through the two nodes of third node block 32. If, in the 
course of progression, the cursor 34 discovers further nodes 
for processing such nodes are appended to the next available 
node in the current append block as designated by pointer 40. 
5 Because of the structure of data processing structure 10 such 
additional nodes will always be placed after the current 
position of the cursor 44. It follows that, at any given 
instant, all nodes through which cursor 44 has progressed 
have been processed. It follows also that any nodes 

10 requiring work still to be done will always be found ahead of 
the current cursor position. In practice the number of nodes 
set aside for each node block 30, 31, 32 may be different 
from node block to node block. However, the number of nodes 
in any given node block which is set up will always be known 

15 in advance of the creation of that node block and as 
determined by control structure 38. In one typical example 
the first node block could, in practice, contain 256 nodes 
with subsequent node blocks each containing double the number 
of nodes of the previous node block. 

20 The binary tree or sparse array implemented as the data 

processing structure of the preferred embodiment is an 
extremely useful and efficient structure for representing 
both work done; work to do and recording further work to do 
while doing work. 
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The above describes only some embodiments of the present 
invention and modifications, obvious to those skilled in the 
art, can be made thereto without departing from the scope and 
spirit of the present invention. 



5 
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CLAIMS 

1. A data processing structure comprising a chain of data 
blocks containing a binary tree structure; each of said 
data blocks containing a predetermined number of nodes. 
5 2. A data processing structure comprising one or more node 
blocks and a control structure; said control structure 
grouping nodes in said node blocks according to a 
physical ordering in computer memory. 

3 . The data processing structure of Claim 2 wherein each 
10 said node block contains a pointer to the next node 

block and an array of binary tree nodes. 

4 . The data processing structure of Claim 3 where each said 
binary tree node contains a value, a left pointer and a 
right pointer. 

15 5 . The data processing structure of Claim 2 wherein said 
control structure includes the following: 

(a) the number of nodes in a block; 

(b) a pointer to the first block; 

(c) a pointer to the last block; 

20 (d) a pointer to the current append block; 

(e) a pointer to last node appended in the current 
append block; 

(f) the number of free nodes in the current append 
block. 
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6. A method of storage of a binary tree structure, said 
method comprising storing nodes of said tree in node 
arrays; each said node array comprising a predetermined 
number of node storage locations; said node storage 

5 locations of said predetermined number of node storage 

locations having a predetermined physical ordering in 
memory. 

7. A method of operation on the structure of any one of 
claims 1 to 5 wherein a traversal cursor is utilized to 

10 keep track of work done; said cursor being advanced in a 

linear progression through said data processing 
structure as each nodie in said structure is processed 
whereby nodes through which said cursor has advanced do 
not require further processing. 

15 8 . A method of processing a directed graph wherein as 
further work to do is discovered in the course of- 
progressing a traversal cursor through said graph, the 
work to do is inserted as a node ahead of the current 
location of the traversal cursor. 

20 
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